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ABSTRACT 

Aim: In this study, genetic polymorphism of two tRNA-liked short tandem repeat (STR)-containing loci, R-R and S-Q, 
was analyzed in order to clarify further the genotypic differences among E. dispar isolates. 

Background: Entamoeba dispar is closely related to the human pathogen E. histolytica, the agent of amebic dysentery 
and amebic liver abscesses. E. dispar is, to some extent, capable of producing variable focal intestinal lesions in animals 
and of destroying epitheUal cell monolayers in vitro, and some have reported it to be capable of producing amoebic liver 
abscess in hamsters. However no evidence exists at present to link E. dispar with human disease. 

Patients and methods: A total of 28 E. dispar samples from gastrointestinal disorder patients were characterized using 
PCR and sequencing. The sequences obtained were edited manually and aligned. 

Results: sequence analysis showed 9 and 6 different pattems of units in the repeat-containing region of R-R and S-Q, respectively. 
The repeat-containing regions of R-R and S-Q loci were found to be extensively polymorphic, varying in size, number 
and order of repeat units. 

Conclusion: The results demonstrate extensive genetic variability among Iranian E. dispar clinical isolates. The genetic 
diversity of tRNA gene-linked STR loci shows them to be suitable for epidemiological studies such as the 
characterization of the routes of transmission of these parasites in Iran. 
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Introduction 

In 2005, the genome sequence of the 
Entamoeba histolytica strain most widely used in 
research laboratories, HM-1:IMSS, was completed 
(1). One of the striking findings was the 
abundance and unique organization of the tRNA 
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genes (2). Over 10% of all the sequence reads 
contained tRNA genes and almost all were 
organized in tandem arrays. The inter genie regions 
are rich in A+T (about 80%) and contain non- 
coding short tandem repeats (STRs) (3). Diversity 
in these loci among E. histolytica strains are 
mainly due to varying numbers of STRs (4). 

When a comparison is made between E. 
histolytica and E. dispar the loci vary not only in 
the number but also in the sequence and 
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arrangement of STRs (5, 6). Ali et al. (2005) have 
described a system for typing E. histolytica strains 
based on 6 of the tRNA gene-linked STRs (7). 
Using this method, a recent study of E. histolytica 
samples collected from asymptomatic, 
diarrhea/dysenteric and amebic liver abscess 
patients in Bangladesh revealed that the parasite 
genotype prevalences identified in the 3 groups 
were significantly different from each other, 
suggesting that the parasite genotype plays a role 
in the outcome of infection in humans (8). 

E. dispar is closely related to E. histolytica 
but has never been documented to cause disease 
in humans, although a recent report suggests 
that a few strains may be able to produce liver 
abscesses in hamsters (9). Genetic variation in 
E. dispar has not been widely studied to date. 
To investigate this, we have been comparing the 
structure of STR loci in E. dispar using strains 
from Iran. In this study, genetic polymorphism 
at two tRNA-hked STR-containing loci, R-R 
and SQ, was analyzed in Iranian E. dispar 
isolated from patients with GI symptoms using 
PGR and sequencing methods. 

Patients and Methods 

A total of 28 E. dispar strains were analyzed. 
Clinical information on the samples is given in Table 
1. All the samples used in this study were diagnosed 
as positive for Entamoeba spp. by microscopic 
examination of fresh stools using direct smears, 
formalin-ether concentrated, and trichrome stained 
specimens (10). 

The genomic DNA was extracted directly 
from stool and samples were identified as E. 
dispar by loci D-A and A-L based PGR analysis, 
as previously described (11, 12). For genotype 
analysis, loci R-R and S-Q were amplified by 
PGR using two E. dispar specific pairs of 
oligonucleotides previously described (7). PGR 
products were analyzed by electrophoresis using 
1.8% agarose gels (Fermentas, #R049I) in Tris- 



boric acid-EDTA buffer containing ethidium 
bromide after which the gels were 
photographed under ultraviolet light (UVIdoc, 
UVItec Limited, Cambridge, United Kingdom). 
The PGR products were sequenced using the 
amplification primers and an Applied 
Biosy stems (ABI) BigDye® Terminator V3.1 
Gycle Sequencing Kit, and analysed on an ABI 
3130x1 Genetic Analyzer. 

The sequences obtained were edited 
manually and aligned using Gene Runner 
software (version 3.05). Nucleotide sequences, 
except for the forward and reverse primer 
regions, were aligned with the only previously 
available locus R-R and S-Q sequences from E. 
dispar in GenBank (EF421343 and AY842971). 
All new sequences were submitted to the 
GenBank/EMBL/DDBJ database under 
accession numbers HQ439911-30 for S-Q Loci 
and HQ439959-83 for R-R loci. 

Results 

Polymorphism in nucleotide sequences of the 
noncoding STR locus R-R 

In order to better understand the nature of the 
polymorphisms among the Iranian strains, we 
amplified and sequenced the individual products 
fi-om locus R-R Irom all 28 isolates. PGR of the 
Iranian isolates amplified a product of between 586 
and 726 bp in all 28 samples. Sequences revealed a 
complex inter-isolate polymorphism in length, 
location, and number of the repeat units (Figure 1). 
Based on the sequences of locus R-R, the 28 Iranian 
isolates were divided into 9 distinct types (a to h), 
with genotype 'h' being the dominant type (35.7%). 

Polymorphism in nucleotide sequences of the 
noncoding DNA locus S-Q 

PGR of locus S-Q in Iranian isolates amplified a 
Iragment of between 381 and 437 bp in 20 samples. 
No amplification was observed at this locus in 8 
isolates. Polymorphisms in the type, location, and 
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number of repeat units were observed in the repeat- 
containing region of locus S-Q (Figure 2). 
However, locus S-Q appeared to be less polymorphic 
than locus R-R. Based on the nucleotide sequences 
obtained, the 20 Iranian isolates were divided into six 
distinct S-Q types (1 to 6), with genotype 5 being the 
dominant type (30%). 

Table 1. Background of E. dispar isolates 
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Figure 1. Schematic representation of the STR 
polymorphisms in locus S-Q of E. dispar. The 6 
distinct sequence types are shown as well as the 
identification tag for the isolates that matched each 
type; also shown is the structure of locus S-Q 
sequence in the standard isolate, E. dispar SAW760 
{AY842971). The sequences of each of the 3 repeat 
types are shown beside their corresponding colored 
block. The conserved non-repeated regions are 
shown as a single line. 



Discussion 

The ability to identify strains of Entamoeba 
dispar may lead to insights into the population 
structure and epidemiology of the organism. 
Zaki et al. (5, 13) showed that using two STR- 
loci in combination allowed differentiation of a 
majority of the E. histolytica and E. dispar 
isolates studied based on product size, and they 
proposed that these loci had the potential to be 
used as polymorphic molecular markers for 
investigating the epidemiology of these 
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organisms. Independently and using a repeat 
region of a gene coding for a serine-rich antigen 
(SREHP), Ayeh-Kumi et al. (2001) showed the 
majority of E. histolytica could be differentiated 
using the repeat region of this gene and that 
samples from liver abscess patients had 
polymorphisms which were not present in the 
intestinal isolates from the same geographic area 
(14). 
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Figure 2. Schematic representation of the STR 
polymorphisms in locus R-R of E. dispar. The 9 distinct 
sequence types are shown as well as the identification tag 
for the isolates that matched each type; also shown is the 
structures of locus R-R sequence in the standard isolate, E. 
dispar SAW760 (EF421343). The sequences of each of 
the nine repeat types are shown beside their corresponding 
colored block. The conserved non-repeated regions are 
shown as a single line. 

From 79 E.histolytica strains isolated from 
patients with and without symptom in at least 8 



different countries, Haghighi et al. (2002, 2003) 
sequenced PCR products from 4 loci. Limited 
PCR product size variation has seen in the 
chitinase gene but fairly superior sequence 
variety, with a total of 9 sequence types and an 
identical number of predicted peptide 
sequences. Thirty seven sequence types and 31 
different peptide sequences seen in the majority 
of polymorphism with SREHP gene. The two 
tRNA-linked STRs showed intermediate 
polymorphism with 13 to 15 sequence types. 

Outside of the repeat regions in the 4 loci, no 
single nucleotide polymorphisms (SNPs) were 
detected, and also there was no correlation 
between clinical and sequence types (15, 16). 

As the studied populations in the studies of 
Haghighi et al. collected from different 
geographically regions and obtained during 
consequences years, they did not detect a link 
between genotype and symptoms. In contrast 
Ali et al. (2007) found dissimilarity among 
sample groups used a geographically and 
temporally control group (8). Eventually any 
correlations would be lost as new genotypes 
arise in the population due to instability of 
criteria that can measure. Therefore, they must 
be mutating to generate the diversity in 
populations. 

In the current study the links between 
diversity and virulence have shown using size 
and sequencing of tRNA-linked STRs. 

Flow samples of stool and liver abscess from 
18 ALA patients from USA, Italy and 
Bangladesh showed that the genotypes of the 
intestinal amebae were dissimilar to the 
corresponding flow samples of the same patient 
(3). The mechanism of this result is vague, but it 
suggests that only small numbers of multiple 
genotypes in intestinal could migrate to the liver 
or recombination of DNA procedures are 
arranged prior to or during migration of the 
amebae from the intestine to the liver. 
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The genetic polymorphisms in loci D-A and 
A-L in 28 isolates of E. dispar from three 
different geographic regions of Iran were 
previously studied using PCR and sequencing. 
The sequenced products revealed 12 and 7 novel 
E. dispar genotypes respectively and showed the 
loci to have the potential for use in 
epidemiological studies, such as the identifying 
the routes of transmission of these parasites in 
Iran (11, 12). 

Use of an additional two targets for 
amplification in this study (loci S-Q and R-R) 
means that fingerprinting of 28 Iranian isolates 
has been completed at four loci (table 2). 

Sequence analysis showed 9,12,7 and 6 
different patterns based on variation of units in 
this repeat containing region of R-R, D-A, A-L 
and S-Q loci, respectively. 



The tRNA gene regions in loci S-Q and R-R 
similar to loci D-A and A-L are conserved and 
are the site of the primers used, but in the 
middle there are repeat units of between 6 and 8 
nucleotides which vary among isolates. 

Elimination, duplication and substitution of 
units in this repeat-containing region are the 
basis of polymorphisms detected in the two 
species. 

By the simultaneous investigation of locus 
A-L and locus D-A (11, 12), 26 subtypes out of 
28 E.dispar isolates were distinguished (the 
molecular patterns of NH19IR and NH20IR, 
also NHM2IR and NHM3IR are not different in 
two loci). 

NH 19IR and NH 20IR isolated from Tehran, 
SHN 3IR and SHN 4IR isolated from Zahedan, 
NHM IIR and NHM 2IR isolated from Gonbad 



Table 2. Fingerprinting of 28 Iranian isolates at four loci (S-Q , R-R, A-L and D-A) 
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and also NHM IIR and NHM 4IR are similar in 
3 loci and differentiation of those isolates based 
on fourth loci are possible. 

The results demonstrate an extensive genetic 
variability among E. dispar clinical isolates. The 
repeat containing regions of R-R, D-A, A-L and 
S-Q loci was found extensively polymorphic in 
size, number and also in order of repeat units. 
The genetic diversity of tRNA Gene-Linked 
short Repeat loci shows them to be suitable for 
epidemiological studies. 

In conclusion, we propose that molecular 
typing and analysis of genotypes of E. 
histolytica and E. dispar isolates from a variety 
of locations will help in determining the 
geographic origins of isolates and routes of 
transmission. 
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